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E-MAIL FILTER AND METHOD THEREOF 



FIELD QF THE; IWEflTIOfr 

The present invention relates to an electronic or 
e-mail filter system as well as a method of filtering unwanted 
e-mail messages. 
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BACKGROUND O F THE INVENTION 

No cogent argument can legitimately be made refuting 
the fact that technology while generally benefitting mankind, 
does have its occasional deficiencies. This is certainly true 
with respect to the communications industry. Unfortunately , each 
technology advancement relating to the ease and facility of 
providing communications between various individuals or companies 
have created minor headaches or problems. Although used 
sporadically since the early 1920 's the utilization of the 
airplane in the mail industry since the end of World War II 
allowed individuals and communities on both the east and west 
25 coast to be linked with one another. Mail sent from New York to 
Los Angeles would be received within two or three days from the 
date that the communication was originally posted. Although the 
use of airmail had a salulatory affect upon the communication 
between individuals and other entities, various companies seized 
upon this relatively inexpensive means of communication to 
inundate the public with a large number of junk mail solicita- 
tions. Unfortunately, to the chagrin of many of these junk: mail 
operators, the public could generally determine which mail was 
important and which was not, based upon a number of factors such 
35 as the type of envelopes which were utilized, the return address 
of the sender as well as the manner in which the sendee was 
addressed. Therefore, many of these solicitations were never 
opened and were merely discarded. 

The deregulation of the telecommunications industry as 
well as the increased usage of "800" type numbers has resulted 
in an increased number of unwanted telephone solicitations. 
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While tending to be an annoyance, once the called party, 
determines that they are not interested in any solicitations or 
the type of solicitations offered by the caller, the called party 
can merely hang up his or her receiver. 
5 Increased use of facsimile machines in both the work 

environment as well as personal facsimile machines at home 
created another avenue for unwanted solicitations. Since 
technology allowed a single letter of solicitation to be 
transmitted to a large number of facsimile machines with ease/ 

10 it is easy to see that facsimile machine solicitations became an 
annoying problem, particularly when the individuals machine was 
receiving a large number of correspondences utilizing the 
receivers own paper. Furthermore, these solicitations were tying 
up the users telephone line so that important messages were 

15 delayed or never received. Due to an outcry by the public, 
legislation was passed to forbid these types of unsolicited 
communications directed to facsimile machines. 

The explosion in the personal computer "PC" industry 
has provided solicitors with yet another manner of sending 

20 unsolicited messages. More and more businesses as well as 
individual users are connected to one another over the Internet 
and Intranet 11. Similar to the situation with respect to 
facsimile machines, a solicitor can compose a message and send 
it on the Internet and Intranet 11 to a relatively large number 

25 of personal computers. Although these e-mail messages are not 
necessarily reproduced on paper in the manner that the facsimile 
messages were previously received, the receipt of these messages 
would prevent other legitimate messages from being received in 
a timely manner. Therefore, it is clear that a system and method 

30 of filtering unwanted e-mail messages must be developed to shield 
the PC user from the annoyance of unsolicited junk e-mail. 

U.S. Patent 5,619,648 issued to Canale et al is 
directed to a technique for reducing the amount of junk e-mail 
received by a user in an e-mail system. As illustrated with 

35 respect to Figure 1 of the Canale et al patent, a user 105 who 
wishes to reduce the amount of junk e-mail which is received, 
would be provided with a mail filter 109. A mail item 119 in the 
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system would include a standard e-mail message as well as a 
recipient specifier 121 which uses non-address information to 
further describe the recipients who would receive the e-mail as 
well as a referral list 127 which is a list of potential 
5 recipients who pass the e-mail on and of recipients to whom the 
e-mail was provided. The recipients specifier 129 also includes 
a recipient description 125. If the recipient description 
specifies a recipient which is of the same kind as that specified 
by the user model 113 , the mail filter 109 adds the mail item 119 

10 to filtered mail 115. The mail filter 109 can utilize the 
information in the referral list 127 to indicate a chain of 
referrals which resulted in the message being directed to the 
user 105. While this system can be utilized to reduce a users 
junk e-mail, it does not necessarily include a filter technique 

15 in which mail sent by a sender included in an approved guest list 
filter would be designated as such when received by the user. 
Additionally, this system is not utilized in a manner allowing 
an updated master list of junk e-mail addresses or senders to be 
developed and transmitted to other users in the system. 

20 U.S. Patents 5,093,918 issued to Heyen et al; 5,283,856 

issued to Gross et al; 5,377,354 issued to Scannell et al; 
5,632,011 issued to Landfield et al and 5,634,005 issued to 
Matsuo are all directed to various systems for sorting and 
managing electronic mail or similar messages. However, similar 

25 to the Canale et al patent, these patents do not describe a 
method or system in which electronic e-mail can be effectively 
filtered by the user as well as compiling an updatable master 
list of unwanted e-mail transmitters which is then transmitted 
to the end user for filter purposes. 

30 

SUMMARY OP TOP INVENTION 

The present invention overcomes the problems of the 
prior art by utilizing a method and system for filtering unwanted 
junk e-mail sent to the user's computer. The user would include 
35 various addresses or other defining characteristics in a "No 
Admittance List" as well as a plurality of addresses in a "Guest 
List". An incoming e-mail whose addresses are included in the 
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"No Admittance List" would be immediately discarded. Any address - 
in the "Guest List" would be immediately forwarded to an "In 
Box". Any address not included in either the "No Admittance 
List" or the "Guest List" would be forwarded to a "Waiting Room". 
The user would periodically review the e-mail included in the 
"Waiting Room" , Based upon this review, the user would either 
discard the e-mail to a "Trash Bin" or would send the e-mail to 
the "In Box". The addresses of e-mails which were discarded 
after the users review could be automatically added to the "No 
Admittance List". Additionally, the address of any e-mail added 
to the "In Box" after the user's review could also be 
automatically added to the "Guest List". Addresses of e-mail 
which are discarded would be periodically sent to a filter server 
thereby adding the addresses to a master list. This master list 
15 is then periodically transmitted to all of the users in the 
system through a download server. 

The following glossary of terms define various comments 
described in this application. 

A Mail Server is any service that handles the Simple 
20 Mail Transfer Protocol (SMTP) . Mail Servers are also known as 
Message Transport Systems (MTS) . Examples of Mail Servers are 
Sendmail, Microsoft's Exchange, etc. 

Mail Storage refers to any type of system for storing 
electronic mail (usually stored per user in mailboxes) . Mail 
25 Storage can consist of file storage, a database, etc. 

A Mail Drop Service is any service that allows users 
to directly retrieve messages from their mailboxes. Users 1 e- 
mail clients usually directly interact with a Mail Drop Service 
via some protocol. Examples of protocols used by Mail Drop 
30 Services are the Post Office Protocol (POP) and the Internet 
Message Access Protocol (IMAP) . 

A Mail Reader is any application that can send and 
retrieve e-mail via a Mail Drop Service. Mail Readers are also 
known as User Agents (UA) . Examples of Mail Readers are 
Qualcomm's Eudora, Microsoft Outlook, Netscape Communicator, Elm, 
Mh, etc. 
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Spam is any unwanted e-mail, also known as unsolicited, 
commercial e-mail (UCE) or junk e-mail. 

PRIEF DESCRIPTION OF THE DRAWINGS 

These and other attributes of the present invention 
will become more apparent in light of the following detailed 
description of an illustrative embodiment thereof, as illustrated 
in the accompanied drawings of which: 

FIGURE 1 is a process flow chart and block diagram 
illustrating the present invention; 

FIGURE 2 is a typical example of a graphic user 
interface according to the present invention; 

FIGURE 3 is a block diagram showing various components 
of the present invention; 

FIGURE 4 is a typical control screen illustrating a new 
search on a member database; 

FIGURE 5 is a control screen illustrating the search 
results of a member database; 

FIGURE 6 is a control screen illustrating a new search 
on an address database; 

FIGURE 7 is a control screen illustrating the search 
results of an address database ; 

FIGURE 8 is a block diagram illustrating the 
collaborative filter of the present invention; 

FIGURE 9 is a block diagram illustrating how the 
collaborative filter is updated; 

FIGURE 10 is a block diagram illustrating sexrver side 
e-mail filtering; and 

FIGURE 11 is a block diagram illustrating the 
collaborative filter replication design. 
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DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Turning to Figure 1, the entire system of the present 
invention 10 broadly includes a section 48 associated with a 
user's personal computer and a section 46 provided at a location 
5 remote from the personal computer and connected therewith by a 
standard wired or wireless telecommunications link or any other 
communications methodology. Each of the users who are part of 
the system according to the present invention, would be provided 
with appropriate software allowing each of the users to prepare 

10 individualized dual filters to automatically prevent certain 
unwarranted e-mail from being received as well as to 
automatically receive desired e-mail. The software can be 
installed directly by the user or would be resident within 
popular mail programs, and once installed be transparent to the 

15 user. One of these filters is automatically updated by other 
users in the system when known unwanted e-mail addresses are 
determined. This software would also allow the individual to use 
a customized graphic user interface to assemble the filters. A 
typical graphic user interface will be discussed in more detail 

20 hereinafter. However, it is noted that the exact nature of the 
graphic user interface can vary depending upon its application 
and implementation. 

The software would allow the individual user to 
construct an automatic discard filter 12. The automatic diacard 

25 filter is a collective term consisting of a user modified discard 
filter, a user personal address filter as well as a user personal 
string filter. During operation of the system, the automatic 
discard filter 12 would include a current filter list comprising 
a list of active e-mail addresses against which incoming e-mails 

30 are compared. This current filter list is retained in a memory 
section of the users computer. Any comparison between any 
incoming e-mail and the current filter list could be accomplished 
within the user's computer system. The current filter list is 
maintained at the remote central location 46 as well as being 

35 periodically updated in each of the users PC systems 48. The 
remote location 46 would include a delta filter server 22 and 
download server 24 for a particular user as well as delta server 
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filter 26 from all other users. The current filter list can be. 
modified by the user to personally remove any addresses therefrom 
through various deletion techniques, thereby providing the user 
with a user modified discard filter. The user personal address 
5 filter would include additional addresses the user has added to 
the current filter list as well as any character strings that the 
user has added via a text entry containing an n @" . For* the 
purpose of the present invention, a text entry is a character 
string entered into the system by keyboard typing. Typing is 
10 initiated by double clicking or highlighting and typing, thereby 
clearing an old string and creating a new string. When the mouse 
is clicked on some other location or "enter" is hit, the string 
will be entered into the appropriate memory structure for the new 
field. 

15 The user personal string filter is defined as any 

character string that the user had added to the automatic discard 
filter to create a "No Admittance List" via text entry that does 
not contain the "@" . The term "No Admittance List" would be a 
list of terms and addresses included to create the automatic 

20 discard filter. The "No Admittance List" 52 is included in the 
graphic user interface 50 illustrated in Figure 2. 

The Guest List Filter 14 includes addresses the user 
has personally added to the system, for example by dragging an 
e-mail to the "Guest List" 54 shown in the graphic user interface 

25 of Figure 2, or by any other means. The Guest List Filter 14 
also includes any character strings the user has added via a text 
entry containing an "@" . 

Any e-mail received by the user is checked against the 
automatic discard filter 12 to determine whether any character 

30 string on the "No Admittance List" 52 will bar entry of any 
e-mail with matching text in its address, subject line or message 
body. If this occurs, that e-mail will be eliminated from the 
users system as indicated by the Trash Bin 16. 

Conversely, any address contained in the Guest List 

35 which matches an address of an incoming e-mail would be 
automatically forwarded to the In Box folder 18 for review by the 
user. Similar to the situation with respect to the No Admittance 
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List 52, a text string entered in the Guest List 54 would forward, 
all messages containing that character string to the »m Box" 
folder 18. This feature would allow users to receive on-demand 
direct marketing information from parties, promoting products for 
which the user has expressed interest based upon the text string 
entered in the Guest List 54. 

Incoming e-mail which is not filtered by the automatic 
discard filter 12 based upon the No Admittance List 52 or is 
included in the guest list filter 14 as embodied in the Guest 
List 54 would be automatically sent to a Waiting Room folder 20 
to be individually reviewed by the user. 

Unknown e-mail stopped by the automatic discard filter 
12 based upon the inclusion of an unwarranted character string 
or based upon a personal review by the user would be used to both 
15 automatically update the addresses included in the automatic 
discard filter as well as to alert other users in the system of 
the existence of objectionable e-mail addresses. These new 
addresses are periodically and automatically transmitted to a 
address filter server 22 provided at the remote central location 
20 46. Based upon numerical and temporal factors as described 
hereinafter, these addresses are included on the current filter 
list associated with the address filter server 22 stored in a 
filter database associated with a database server 24 in 
communication with the address server 22. 

Periodically, the database server 24 in communication 
with the address filter server 22 would download updated filter 
addresses to the various users in the system by constructing an 
address packet consisting of every address on the current filter 
list since the date and time of each of the users last update. 
The address packet is a data structure consisting of N strings 
of e-mail addresses and a variable containing the time of 
construction of the packet. The packet is compressed for 
downloading and uploading multiple e-mail addresses. Based upon 
the particular implementation of the software of the present 
35 invention, the updated version of the current filter list is 
substituted for the No Admittance List currently provided in the 
users system. Alternatively, since the No Admittance List might 
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include addresses and character strings personally added by the. 
user but not included in the current filter list, the updated 
filter list would be compared with the automatic discard filter 
and any additional entries not included in the automatic discard 
5 filter would be added thereto. 

Figure 2 illustrates the In Box folder 18 and the 
Waiting Room 20 in more detail as well as giving examples of the 
type of messages included therein. The list of names included 
on the automatic discard filter 12 are provided in the No 

10 Admittance List 20. Any incoming e-mail whose new address 
matches one of the addresses on this list is immediately 
discarded to Trash 16. Addresses may be added to this list via 
an update button 61, the Add to No Admittance button 58, text 
entry, or by dragging a selected e-mail to this window with the 

15 mouse. The update button 62 automatically downloads the latest 
automatic discard filter from the download server 24. The 
updated filter list is displayed in the No Admittance Window. 
Simultaneously, user added e-mail addresses are sent to the Delta 
Filter Server 22 for consideration in future updates to the users 

20 in the system. 

The Waiting Room folder 20 includes only those e-mails 
that have successfully passed through both the automatic discard 
filter but are not included on the Guest List filter 54. 
Additionally, any e-mail from any folder may be selected and 
dragged into the Waiting Room 20 by the user using his or her 
mouse. The Waiting Room display as shown in Figure 2 would 
include e-mail addresses, the date and time of receipt as well 
as the subject of the e-mail. The exact layout of this Waiting 
Room can be changed depending upon the users requirements. 

An Empty button 56 is associated with the Waiting Room 
20. This button discards all e-mails in the Waiting Room folder. 
If the folder is not empty, a pop up box will be displayed with 
a warning ensuring that the user wishes the Waiting Roo^to^beT 
emptied. if this is the case, a pop up button would allow the 
35 user to proceed. If the user does not wish to empty the e-mails 
in the Waiting Room, the initial request would be canceled and 
the e-mails included therein would not be cleared. 
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The Add to No Admittance button 58 associated with the. 
Waiting Room 20 would add the addresses of all selected e-mails 
in the Waiting Room to the automatic discard filter. The No 
Admittance List in the no admittance window will scroll to reveal 
newly added addresses. 

The Guest List window 54 would include a list of names 
on the Guest List filter. Any incoming e-mail whose new address 
matches one of the addresses on this list is immediately 
forwarded to the In Box folder 18. Addresses may be added to 
this list via the add to Guest List button 60, text entry, or by 
dragging a selected e-mail to this window with a mouse. 

The In Box 18 includes only those e-mails that have 
successfully passed through both the automatic discard filter and 
the Guest List Filter. Additionally, any e-mail from any folder, 
may be selected and dragged into the In Box 18 by the user using 
the mouse. Similar to the Waiting Room 20 the In Box 18 includes 
the e-mail addresses, the date and time of receipt as well as the 
subject matter of the e-mail. Furthermore, the particular 
configuration of the In Box 18 as illustrated in Figure 2 can be 
20 changed depending upon the users requirements. By clicking on 
an open slot in the No Admittance List 52 or the Guest List 54 
or by double clicking on a existing text, the user may enter a 
character string to be checked in the filtering system. Any such 
character string on the No Admittance List will bar the entry of 
any e-mail with matching text in its address, subject line or 
message body. For example, as shown in the No Admittance List 
52, any received e-mail with the words "free money" in its 
subject or message would be discarded. A text string similarly 
entered in the Guest List would forward all messages containing 
30 that character string to the In Box. Text entry can also be used 
to type in new e-mail addresses or edit existing ones on either 
of the filter lists. 

The use of the click and drag technology would allow 
a graphic user to be used to transfer the address or character 
35 string of an e-mail to either the Admittance List or the Guest 
List. 
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Figure 3 illustrates a typical block diagram of the. 
major components of the preset invention. The present invention 
can be utilized by a home user 17 or by a corporate user 19 
connected to the Internet 11. The provider of the preset 
invention is also connected to the Internet and Intranet 11 
allowing a web server 13 to advertise the present invention 
through a home page 15. The provider connected to the Internet 
and Intranet 11 at a second location allows the automatic discard 
filters 11 of the users to be particularly updated. A local area 
network (LAN), wide area network (WAN) or any other type of 
network provided at the remote location 46 allows the address 
filter server 22 to be in communication with the database server 
24 as well as a filter database administration tool 25 allowing 
the provider to maintain complete control over the state of the 
15 address filter and the filter database. Figure 3 also 
illustrates the Internet and Intranet 11 connections between a 
typical home user and a corporate user. 

The address filter server 22 would relay e-mail filter 
user requests to the filter database of the database server 24 
and meeting appropriate calls via RPC to a library on the 
database server or by sending SQL commences to the database 
directly. The address filter service will be implemented via 
connection based (TCP) communication. The sending of new 
addresses from an e-mail filter user to the filter database could 
be handled by connectionless (UDP) communication since failure 
to handle all new addresses is not critical. 

Figure 3 illustrates a situation in which the e-mail 
filtering can occur directly at the' home user's PC 17 or a 
corporate user's PC 19 and communication is provided over the 
30 Internet and Intranet 11 to the remote location 46. However this 
invention can be practiced employing a centralized e-mail system 
database 21 connected to the home user's PC 17 or the corporate 
user's PC 19 through the Internet and Intranet 11. m this 
situation the filtering is accomplished at the centralized e-mail 
35 system database which is the location of the e-mail directed to 
the home user or the corporate user. 
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Figure 4-7 illustrate typical control screens used by. 
the administrator of the present system at a remote central 
location. These screens are used for maintaining, searching and 
editing both the address database which consists of every address 
that has been sent to the central location address filter server 
22 or added via a centralized control interface as well as the 
member database consisting of all members who have set up and 
updated their software. Figure 3 illustrates a "blank" address 
database screen and Figure 4 illustrates this database screen 
when information has been entered therein. 

Both the control screen for the member database 62 and 
the address database 70 contain a search field 64, a search panel 
66 and a filter panel 68. The search field 64 would contain 
information matching an entry in either the address database or 
15 the member database. Buttons 72 and 74 would allow either of 
these databases to appear on the control screen. Both of these 
databases would include search results run in either the address 
database or the member database in Section 66. The current 
filter Section 68 would allow entries to be updated or saved at 
20 various times. It would also include a box 76 indicating the 
number of days an address can remain on the current filter list 
without a new instance of that address being uploaded by the 
filter users. It would also include a box 78 listing the minimum 
number of reportings required for an address to be placed on the 
25 current filter list. Certainly both of these central screens can 
be set up in multitude of ways depending upon the specific 
information to be provided. 

Returning to Figure 1, the process of comparing 
received e-mails to both the Automatic Discard filter 12 and the 
30 Guest List filter 14 will now be explained. Incoming e-mails 28, 
30 and 32 are compared to information contained in the user 
modified discard filter, the user personal address filter and the 
users personal string filter utilizing the address line, the 
subject line as well as the message body. Since the information 
35 included in e-mails 28, 30 and 32 are not contained in the 
automatic discard filter, all three of these e-mails are directly 
transmitted to the Guest List filter 14. The e-mail addresses, 
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subject line and message body of these three e-mails result in, 
a match for all three of these e-mails. Consequently, these 
e-mails are sent to the In Box folder 18. 

E-mails 34 and 36 were sent from a known bulk e -mailer. 
Since information in these e-mails are included in the automatic 
discard filter 12, both of these e-mails are directly sent to 
trash 16. 

Four unknown e-mails 38, 40, 42, and 44 are initially 
sent to the automatic discard filter 12. E-mail filter- 38 is 
automatically stopped by the automatic discard filter and sent 
to trash 16. Although the address of this e-mail is not 
initially included in the automatic discard filter 12 the subject 
line or message body contains a character string included in the 
automatic discard filter. The address of this e-mail is 
15 automatically added to the discard filter 12. During the next 
master filter update, this new junk e-mail address will be 
forwarded to the Delta Filter Server 22. E-mail 40 passes 
through the automatic discard filter 12 and is stopped by the 
Guest List filter 14 and is then forwarded to the Waiting Room 
20 folder 20. Upon review, the user decides to place this address 
on the automatic discard filter. Future e-mails from the same 
sender will be sent to trash. During the next master- filter 
update, this new e-mail would be forwarded to the Delta Filter 
Server 22. E-mail 42 passes through the automatic discard filter 
25 12 and is stopped by the Guest List filter 14 and is then 
forwarded to the Waiting Room folder 20. The user reviews this 
e-mail and decides to place it on the Guest List. Future e-mails 
from the same sender will be sent to the In Box folder 18. 

E-mail 44 passes through the automatic discard filter 
and is stopped by the Guest List filter 14 and sent to Waiting 
Room 20. Since the user took no action with respect to this 
e-mail, it would remain in the Waiting Room folder. 

Collaborative Filtering Technology (CFT) is a filtering 
solution for stopping junk e-mail and involving end users in the 
35 war against spam. CFT works by leveraging the actual spamming 
experiences of end users to create a dynamically changing set of 
junk e-mail filter rules. These rule sets are then used to sort 
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spam. The technology is simple, effective and empowers end users 
by involving them as active players in the spam wars. 

Collaborative Filtering Technology is the best e-mail 
filtering solution for organizations with numerous e-mail users 
5 such as Internet service providers (ISPs), free Internet e-mail 
providers, and MIS departments of major corporations. This 
technology integrates well across various platforms and 
infrastructure architectures, providing an extremely high level 
of end user protection with relatively little administrative 
10 burden. Collaborative filtering maximizes benefits while 
minimizing costs. 

When an end user receives a piece of e-mail that he or 
she decides is junk mail, the user submits that message to the 
Collaborative Filter through a simple button click. The e-mail's 
15 body is analyzed and it is stored in the Collaborative Filter. 
When a small but statistically significant number of the same 
message have been submitted, the Collaborative Filter is updated 
to start filtering all such messages from the system. Unlike 
other e-mail filtering systems, the Collaborative Filter does not 
exclusively utilize source filtering. Source filtering uses an 
e-mail's header information to filter e-mail from given source 
addresses. Experience indicates that source filtering is 
inappropriate for completely filtering junk e-mail since e-mail 
headers are easily forged. Instead, the Collaborative Filter of 
the present invention uses message filtering based on an e-mail's 
body. Because the e-mail's body must contain a message (i.e. 
advertisement) and this message cannot be drastically altered, 
the body is therefore the most appropriate data to be used for 
filtering. 

The Collaborative Filter is an implementation of server 
side filtering. Server side filtering is filtering that occurs 
at the Mail Server or Mail Drop Service level . This is a 
different approach from client side filtering, in which filtering 
occurs at the Mail Reader level. Server side filtering is 
appropriate for any organization that manages its own Mail 
Servers and Mail Drop Services because it saves network, storage 
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resources, and end user time by stopping spam from propagating, 
throughout the network. 

The Collaborative Filter is comprised of two major 
components: the Spam Filter and the Submittal Filter. The Spam 
Filter is responsible for filtering incoming e-mail while the 
Submittal Filter filters user submissions to the Spam Filter. 
These two components cooperate to form the Collaborative 
filtering process that consists of the following major- steps as 
shown in Figure 8 . 

All incoming messages are first filtered at the Mail 
Server using the Spam Filter. If the incoming message is found 
on the Spam Filter, it is discarded. All other messages pass 
through to the Mail Drop Service. 

E-mail is then distributed to the end user mailboxes. 
15 When users download their e-mail from the Mail Drop Service, it 
is filtered again via the Spam Filter. Those that match are 
discarded before the users ever see them. All other messages 
pass safely through to the user. It is necessary to filter at 
both the Mail Server and Mail Drop Service level to ensure that 
20 end users are protected by the latest updates to the 
Collaborative Filter. However, the system would still operate 
if no filtering were to be done at the Mail Drop Service level. 

When end users receive a spam message, they need only 
to press a button to submit it to the Collaborative Filter. This 
25 simple action forwards the junk e-mail to a mailbox (e.g. 
sp^m9jpp,net) where it is examined by the Submittal Filter and, 
if appropriate, is used to update the Spam Filter. If the 
message does not already exist in the Spam Filter, then it is 
added. If it is already present, then the time of submission is 
recorded, and the total number of submissions of that spam is 
increased by one. The filter uses this counter to determine 
whether to filter against this message. Only messages received 
within a certain time frame with a counter greater than or equal 
to a predetermined threshold will be used in filtering. This 
35 threshold ensures that a small but statistically significant 
number of users have submitted to the same spam message before 
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it is filtered. This prevents inappropriate filtering due to. 
user errors of improper submissions . 

System administrators use the Filter Administration 
Tool to define a list of e-mail addresses or domains that cannot 
5 be submitted to the Spam Filter. This list is included in the 
database of the Submittal Filter. This prevents end users from 
submitting messages from known valid sources such as system 
administration broadcast messages or mailing lists. 

Figure 9 illustrates an example of the operation of the 
10 Collaborative Filter. In this simple example, there are two 
users (user A and user B) , and one spam message sent to both 
users . The spam message has never been processed before by the 
Collaborative Filter. 

Initially, the spam message enters the system. User 
15 A logs on first and checks his e-mail. User A downloads two 
messages: his non-spam message and a spam message. Note that the 
spam message passes through the filter since it has never been 
seen before. User A notes the spam message and s-ubmits it to the 
Collaborative Filter via a simple button click. The 
Collaborative Filter uses User A's submission to update the 
filter. From this point on, any new incoming message that 
matches the submitted spam e-mail will be discarded. 

User B now logs on and checks her e-mail. User B only 
downloads the non-spam message because the Collaborative Filter 
filters out the spam message intended for User B based on User 
A's collaboration. 

Note that this is a simplified example of the filter's 
operation. There are only two end users and the Collaborative 
filter begins filtering after one submittal. In actuality, the 
number of end users will be far larger and the Collaborative 
Filter will only start filtering a spam message after it has been 
submitted by a significant number of those users. The majority 
of users will be Spam free like User B, while a small percentage 
of users experience spam and protect the larger end user 
35 community (like User A) . 

The Collaborative Filter consists of two major 
components, namely the Spam Filter and the Submittal Filter. 
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The Spam Filter filters incoming e-mail, and the. 
Submittal Filter filters end user junk e-mail submissions to the 
Spam Filter. End user submissions update the Spam Filter while 
system administrators decide what entries reside on the Submittal 
Filter. System administrators can add, remove, and update 
entries on both the Spam Filter and Submittal filter via a Filter 
Administration Tool. 

The Collaborative Filter is a server side filtering 
solution as shown in Figure 10. Service side filtering occurs 
at the mail Server or Mail Drop Service level. This is a 
different approach from client side filtering, which occurs at 
the Mail Reader level. Server side filtering is appropriate for 
any organization that manages its own Mail Servers and Mail Drop 
Services because it saves network and storage resources by 
stopping spam from propagating throughout the corporate network. 
Another advantage is that end users are not forced to waste time 
downloading, reading, and deleting junk e-mail. Note that client 
side filtering is an appropriate filtering technology when end 
users are not protected from junk e-mail by server side filtering 
products. Figure 10 also illustrates a typical server side 
filtering scenario. 

It is noted that filtering at the Mail Drop Seirvice 
ensures that the most recent updates to the Spam Filter are used 
to eliminate junk e-mail that has already been sent to a user's 
mailbox, but has not yet been downloaded. This allows the 
Collaborative Filter to eliminate spam to most users on its first 
mailing. 

The Collaborative Filter is implemented via a set of 
C libraries. The Collaborative Filter has been implemented in 
C due to the following requirements: 

1. The Collaborative Filter must be efficient. 

2. The Collaborative Filter libraries must be cross- 
platform. 

3. The libraries must allow for easy integration 
with customer software. 

A C implementation achieves these goals because: 
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1. C is a highly efficient development language- 
because it is compiled. Its long history has 
allowed for the development of highly optimized 
compilers. 

2. The C language is highly portable, since most 
hardware vendors provide C compilers for their 
products. In addition, the ANSI C standard 
defines a standard library, which can be reliably 
used to write portable code. 

3. C is an industry-wide development language, used 
by millions of developers around the world. 
Therefore, it is extremely likely that any 
customer can integrate the Collaborative Filter 
libraries into his/her software. In addition, 
most popular development languages allow for 
calling C directly (e.g., C++, Java, Perl, Visual 
Basics, etc) . 

The Spam Filter Library contains the code needed to 
filter e-mail. This library contains the spamCheckf) routine 
20 which is used to check if a given e-mail is junk. The caller 
passes a structure to spamCheckf) which contains pointers to the 
headers and body of the message to be checked. Therefore, if a 
message's headers and body are already in memory (e.g., when a 
Mail Server is processing a SMTP connection), then no memory 
copying will be needed and spamCheckf) can efficiently check 
whether a message is junk. 

The spamCheckf) function checks whether a given e-mail 
is junk in the following fashion: 

1. spamCheckf) generates a signature for the 
message. 

2. spamCheckf ) queries the Spam Database for the 
message's signature. 

3. If the database query does not find the message's 
signature, then the e-mail is not junk and it an 

35 be passed on to end users. 

4. If the database query does find the message's 
signature, then a matching function is used to 
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determine whether or not the message in questions 
truly matches a message in the Spam Filter 
database . 

a. If the matching function does not find 
5 these messages to be equivalent, then 

the message is not junk. 

b. If the matching function does find 
these messages to be equivalent, then 
the message is filtered. 

10 Source filtering is based on a message's sender 

information. Most source based filtering techniques use the 
"From" address from the message's header. Source based filtering 
is not appropriate for completely filtering junk e-mail since 
headers are easily forged. To overcome this limitation, the 
15 Collaborative Filter primarily uses the message body for 
filtering. This is known as message based filtering. 

The signature described in the above process is a hash 
function based on the message's body. Message signatures are 
very important because they allow the Collaborative Filter to 
operate in an efficient manner. The message signature enables 
non-junk e-mail messages to quickly pass through the filter. 
This occurs because it is extremely unlikely that an incoming 
non-junk message signature will match the signature of a junk e- 
mail already stored in the database. 
25 Since the message signature is a type of hashing 

function, there will be some unavoidable signature collisions 
(i.e., two unique messages which generate the same signature). 
The filtering algorithm resolves signature collisions by 
calculating a matching function on both messages to ascertain if 
these messages are really equal. The matching function uses a 
combination of techniques (e.g., checksum, fuzzy matching) to 
generate a likelihood that two messages are essentially 
equivalent. Exact comparisons cannot be used since junk e-mail 
senders will embed extra characters in their outgoing e-mails to 
circumvent message based filtering techniques. For example, 
spammers may add extra characters at the beginning of a message 
by including personalized salutations. A fuzzy matching function 
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is the appropriate solution to this problem because a spammer, 
cannot change that portion of an e-mail's body that contains his 
or her message (e.g., advertisement). 

The Submittal Library contains the code needed to 
handle user submissions to the Collaborative Filter. The library 
contains the spamSubmit ( ) routine which is used to submit a 
user's junk e-mail message to the Collaborative Filter. 

End users will have a mechanism for forwarding a piece 
of junk e-mail to the Collaborative Filter. This mechanism will 
forward the junk e-mail message to a defined mailbox for handling 
junk e-mail (e.g. spam@isp.net). The Submittal Library is then 
used to process these incoming junk e-mail submissions. 

The main job for the Submittal Library is to filter 
incoming submittals to ensure that valid messages are not 
15 included in the spam filter. Examples of such messages are 
system administration broadcast messages and mailing lists. 

The spamSubmit () routine submits spam to the Spam 
Filter in the following fashion: 

1. The forwarded junk e-mail is parsed from the 
20 submission e-mail sent by an end user. 

2. The Submittal Filter is checked to determine 
whether this submitted junk e-mail should be 
added to the Spam Filter. 

3 . When a submitted junk e-mail is added to the Spam 
25 Filter, the Spam Filter is updated by 

spamSubmit () . 

Note that the Submittal Filter is not updated via the 
spamSubmit () routine. The Submittal Filter is updated by the 
Filter Administration tool that uses direct SQL commands to 
30 add/remove/update entries on the Submittal Filter. 

The Spam and Submittal Filters are both stored in 
relational databases. Relational database technology was chosen 
due to its mature nature and ability to handle numerous 
transactions. These are extremely important considerations since 
the Collaborative Filter will filter all incoming e-mail - 

The Collaborative Filter only interacts with relational 
databases via SQL commands. This allows the Collaborative Filter 
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to be integrated with any SQL compliant database. Leveraging, 
existing database technology allows the execution of a filtering 
product that is more cost effective, efficient and reliable. In 
addition, since the filter back-end is implemented via a 
5 database, customers and other third parties can access the 
filter's data for their own specialized needs. 

By using off the shelf relational database products, 
the Collaborative Filter can utilize those products 1 replication 
technology to scale the Collaborative Filter across the 

10 enterprise as shown in Figure 11. The Collaborative Filter's 
replication design is based on one master database and numerous 
read-only replicated sites. This simple replication strategy 
contains no update conflicts and is available from various 
database vendors (e.g., Sybase's Replication Server, Oracle 

15 Snapshots) . 

Under this scheme, network bandwidth allocation is 
flexible, since the schedule for database replication can be 
modified to meet other operational needs. Administrators have 
control over how much network bandwidth they are willing to spend 

20 for more effective junk e-mail filtering (i.e., the more 
synchronized the databases, the more effective the filter) . 

The Filter Administration tool allows system 
administrators to administrate the Spam Filter and Submittal 
Filter. This tool is the main mechanism for adding, removing and 

25 updating entries on the Submittal Filter. 

The Filter Administration Tool consists of a web based 
front end making Hyper Text Transfer Protocol (HTTP) requests to 
a Java Servlet interacting with the Spam Filter and Submittal 
Filter databases with Java Database Connectivity (JDBC) . This 

30 design has the following advantages. 

1. Administration can use any web browser to 
administrate the Collaborative Filter. 

2. This Java solution allows the Filter 
Administration Tool to be both platform and 

35 browser independent. 
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3. The performance requirements of the application, 
are well within the performance parameters of 
Java. 

The Collaborative Filter is designed to be tightly 
5 integrated with a customer's existing Mail Servers, Mail Drop 
Services, and relational databases. Due to the range of customer 
requirements, one should anticipate that some amount of custom 
coding would likely be required to integrate the Collaborative 
Filter into a customer's operations. . 
10 A Mail Server and Mail Drop Service Integration is the 

most involved step in integrating the Collaborative Filter since 
source code modifications must be made to a customer's Mall 
Server and Mail Drop Service. The Mail Server and Mail Drop 
Service must be modified to call the spamCheck() routine at 
15 appropriate places. 

However, even if customers do not have access to their 
Mail Server or Mail Drop Service source code, the Collaborative 
Filter can still be integrated in a proxy application. Fox- 
example, a proxy SMTP application can be built which sits on top 
of the Mail Server and makes calls to the Spam Filter library. 
E-mail that passes the filter in the proxy application is then 
forwarded to the Mail Server by the proxy. Note that the same 
can be done for P0P3 and IMAP4 Mail Drop Services. 

A relational database system must be allocated to hold 
25 the Collaborative Filter. The system must be able to handle the 
extra bandwidth generated by queries form the Spam Filter 
library. For large organizations, it is recommended to use a 
dedicated database system for the Collaborative Filter. 
Customers should note that the Collaborative Filter has an 
30 unusual query load compared to most on-line transaction 
processing systems, since over 90% of its requests will be pure 
queries (i.e., there will be very few inserts, updates or 
deletes) . Due to this unique query load, a dedicated database 
system that can be optimized for filtering is recommended. 

Once a database system has been allocated, installation 
consists of running several SQL scripts to install the 
Collaborative Filter. When the database has been installed, the 
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customer will have to perform standard database administration 
tasks (e.g., backup) . 

Once the Spam Filter and Submittal Filter databases 
have been set up, the Filter Administration Tool's Java Servlet 
must be installed on a Web server that has access to these 
databases. System administrators will then connect to this Web 
server to administrate the Collaborative Filter via their Web 
browser . 

The present invention has been explained with respect 
to specific arrangements and methods. However, it is noted that 
these arrangements and methods are merely illustrative of the 
principles of the present invention. Numerous modifications in 
form and detail may be made by those of ordinary skill in the art 
without departing from the scope of the present invention. 
Although this invention has been shown in relation to a 
particular preferred embodiment, it should not be considered to 
be so limited. 
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WHAT IS CLAIMED IS: 

1. A system for filtering electronic mail received over 
a communication medium to a user's computer comprising: 

a first filter provided in the user's computer 
provided with first a list of addresses from which the user does 
not wish to receive electronic mail; 

a second filter provided in the user's computer 
provided with a second list of addresses from which the user does 
wish to receive electronic mail; 

comparison device provided in the user's computer 
for comparing the addresses included in said first filter and 
said second filter with the address of electronic mail received 
by said user's computer, wherein any electronic mail whose 
address is discovered by said comparison device to be included 
in said first list of addresses will be automatically deleted 
from the user's computer, and further wherein any electronic ma.il 
whose address is discovered by said comparison device to be 
included in said second list of addresses would be retained in 
the user's computer for review by the user. 

2 . The system in accordance with claim 1 , further 
including a first folder for retaining electronic mail received 
in the computer whose address is not included in said first list 
of addresses but is included in said second list of addresses. 

3. The system in accordance with claim 2, further 
including a second folder for retaining electronic mail received 
in the computer whose address is not included in either said 
first list of addresses or said second list of addresses for 
review by the user at a later time, and further including an 
update device for adding addresses to said first filter or said 
second filter based upon the user's review, thereby creating a 
first updated list of first addresses and a first updated list 
of second addresses. 
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4. The system in accordance with claim 1, further, 
including a device for periodically sending said first updated 
list of first addresses to a master database situated at a 
location remote from the user's computer. 

5 

5. The system in accordance with claim 4, further 
including a device for periodically receiving a list of addresses 
from the master database to be added to said first filter* to 
create a second updated list of first addresses. 

10 

6. A system for filtering electronic mail received over 
a communication line to a user's computer comprising: 

a first filter provided in the user's computer 
provided with a first list of addresses from which the user does 

15 not wish to receive electronic mail as well as a first list of 
objectionable character strings; 

a second filter provided in the user's computer 
provided with a second list of addresses from which the user does 
wish to receive electronic mail as well as a second list of 

20 acceptable character strings; 

comparison device provided in the user's computer 
for comparing the addresses and character strings included in 
said first filter and said second filter with the address of 
electronic mail as well as character strings provided in the 

25 electronic mail received by said user's computer, wherein any 
electronic mail whose address is discovered by said comparison 
device to be included in said first list of addresses or 
electronic mail containing a character string included in the 
first list of objectionable characters string, will be 

30 automatically deleted form the user's computer, and further 
wherein any electronic mail whose address is discovered by said 
comparison device to be included on said second list of addresses 
or said second list of acceptable character string, would be 
retained in the user's computer for review by the user. 

35 

7. The system in accordance with claim 6, further 
including a first folder for retaining electronic mail received 
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in the computer whose address is not included in said first list 
of addresses, said first list of objectionable character strings, 
in said second list of addresses or said second list of 
acceptable character strings. 

5 

8. The system in accordance with claim 7, further 
including a second folder for retaining electronic mail received 
in the computer whose address is not included in said first list 
of addresses, said first list of objectionable character strings, 

0 said second list of addresses or said second list of acceptable 
character strings, for review by the user at a later time, and 
further including an update device for adding addresses to said 
first filter or said second filter based upon the user's review, 
creating first updated first list of addresses and an updated 

5 second list of addresses. 

9. The system in accordance with claim 8, further 
including a device for periodically sending said first updated 
list of addresses to a master database situated at a Location 

0 remote from the user's computer. 

10. The system in accordance with claim 9, further 
including a device for periodically receiving a list of addresses 
from the master database to be added to said first filter to 

5 create a second updated list of first addresses. 



11. A system for filtering electronic mail transmitted 
on a communication medium comprising: 

one or moire user computers connected to the 

30 communications line, each of said computers including a first 
filter provided in each of said computers provided with a first 
list of addresses from which a user does not wish to receive 
electronic mail, a second filter provided in said computers 
provided with a second list of addresses from which the user does 

35 wish to receive electronic mail, a comparison device provided in 
each of the computers for comprising the addresses included in 
said first filter and said second filter with the address of 
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electronic mail received by said computer, a first folder for 
retaining electronic mail received in said computer whose address 
is not included in said first list of addresses but is included 
in said second list of addresses, a second folder for retaining 
electronic mail received in said computer whose address is not 
included in neither said first list of addresses or said second 
list of addresses for review by the user at a later time, and 
further including an updated device for adding addresses to said 
first filter based upon the user's review, thereby creating a 
first updated first list of addresses; and 

master server and master database connected to 
said one or more computers by the communication medium for 
periodically receiving said first updated first list of addresses 
to be inserted into said master database and for periodically 
transmitting to each of said computers a list of addresses to be 
added to said first filter to create second updated list of first 
addresses . 

12. A method for filtering electronic mail received 
over a communications medium at a user's computer comprising: 

creating first list of addresses from which the 
user does not wish to receive electronic mail; 

inputting said first list of addresses into the 
user's computer as a first filter; 
25 creating a second list of addresses from which the 

user does wish to receive electronic mail 

inputting said second list of addresses into the 
user's computer as a second filter; 

comparing the addresses included in said first and 
30 second list of addresses with the address of a received 
electronic mail; 

automatically deleting the electronic mail form 
the user's computer if the address of the electronic mail is 
included in said first list of addresses; and 

retaining the electronic mail in a first location 
in the computer if the address of the electronic mail is in said 
second list of addresses or retaining the electronic mail in a 
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second location in the computer if the address of the electronic, 
mail is not included in either said first list of addresses or 
said second list of addresses. 

5 13. The method in accordance with claim 12, further 

comprising the steps of: 

reviewing any electronic mail included in said 
second location; 

eliminating any unwanted electronic mail based 
10 upon said reviewing step; 

recording the electronic address of any unwanted 
electronic mail in said first filter to create a first updated 
list of first addresses; and 

periodically transmitting said first updated list 
15 of first addresses to a master database situated at a remote 
location to create an updated master database. 

14. The method in accordance with claim 13, further 
including the step of periodically transmitting said updated 
master database to said one or more user computers to produce a 
second updated list of first addresses to be included into said 
first filter. 
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15. The method in accordance with claim 12, wherein 
said first filter includes a list of objectionable character 
strings compared to character strings included in a received 
electronic mail. 

16. A system for filtering electronic mail received 
over a communication medium to a user's computer comprising: 

a first filter provided in the user's computer 
provided with first a list of objectionable character strings 
from which the user does not wish to receive electronic mail if 
at least one of said objectionable character strings is included 
35 in the electronic mail; 

a second filter provided in the user's computer 
provided with a second list of acceptable character strings from 
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which the user does wish to receive electronic mail if at least, 
one of said acceptable character strings is included in the 
electronic mail; 

comparison device provided in the user's computer 
for comparing the character strings included in said first filter 
and said second filter with the subject line and body of 
electronic mail received by said user's computer, wherein any 
electronic mail which includes a character string discovered by 
said comparison device to be included in said first list of 
objectionable character strings will be automatically deleted 
from the user's computer, and further wherein any electronic mail 
which includes a character string discovered by said comparison 
device to be included in said second list of character strings 
would be retained in the user's computer for review by the user. 

17. The system in accordance with claim 16, further 
including a first folder for retaining electronic mail received 
in the computer whose subject line and body does not include a 
character string proceed in said first list of character strings 
but is included in said second list of character strings. 

18. The system in accordance with claim 17, further 
including a second folder for retaining electronic mail received 
in the computer not included in character strings provided in 
either said first or second lists for review by the user at a 
later time, and further including a update device for adding 
character strings to said first filter or said update device for 
adding character strings to said first filter or said second 
filter based upon the user's review, thereby creating a first 
updated first list of objectionable character strings and an 
updated second list of acceptable character strings. 



19. The system in accordance with claim 16, further 
including a device for periodically sending said first updated 
35 first list of objectionable character strings to a master 
database situated at a location remote from the user's computer. 
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20. The system in accordance with claim 19, further 
including a device for periodically receiving a list of objec- 
tionable character strings, 

21. A system for filtering electronic mail recorded 
over a communication medium by a user's computer comprising: 

a first filter located at a central location 
provided with first a list of addresses from which the user does 
not wish to receive electronic mail; 

a second filter located at the central location 
provided with a second list of addresses from which the user does 
wish to receive electronic mail; 

comparison device located at the central location 
for comparing the addresses included in said first filter and 
15 said second filter with the address of electronic mail received 
at the central location and directed to the user, wherein any 
electronic mail whose address is discovered by said comparison 
device to be included in said first list of addresses will be 
automatically deleted from central location, and further wherein 
any electronic mail whose address is discovered by said compari- 
son device to be included in said second list of addresses would 
be retained at the central location for review by the user. 

22. The system in accordance with claim 21, further 
25 including a first folder for retaining electronic mail received 

at the central location whose address is not included in said 
first list of addresses but is included in said second list of 
addresses. 



20 



30 



35 



23. The system in accordance with claim 22, further 
including a second folder for retaining electronic mail received 
at the central location whose address is not included in either 
said first list of addresses or said second list of addresses for 
review by the user at a later time, and further including an 
update device for adding addresses to said first filter or said 
second filter based upon the user's review, thereby creating a 
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first updated list of first addresses and a first updated list, 
of second addresses. 

24. The system in accordance with claim 21, further 
5 including a device for periodically sending said first updated 
list of first addresses to a master database situated at a 
location remote from the user's computer and the central 
location. 

10 25 • Tne system in accordance with claim 24, further 

including a device for periodically receiving a list of addresses 
from the master database to be added to said first filter to 
create a second updated list of first addresses. 

15 26. A system for filtering electronic mail received 

over a communication medium by a user's computer comprising: 

a first filter located at a central location 
provided with a first list of addresses from which the user does 
not wish to receive electronic mail as well as a first list of 
20 objectionable character strings; 

a second filter located at the central location 
provided with a second list of addresses from which the user does 
wish to receive electronic mail as well as a second list of 
acceptable character strings; 

comparison device located at the central location 
for comparing the addresses and character strings included in 
said first filter and said second filter with the address of 
electronic mail as well as character strings provided in the 
electronic mail received at the central location, wherein any 
electronic mail whose address is discovered by said comparison 
device to be included in said first list of addresses or 
electronic mail containing a character string included in the 
first list of objectionable character string, will be 
automatically deleted from the central location, and further 
wherein any electronic mail whose address is discovered by said 
comparison device to be included on said second list of addresses 
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or said second list of acceptable character strings, would be. 
retained at the central location for review by the user. 

27. The system in accordance with claim 26, further 
including a first folder for retaining electronic mail received 
at the central location whose address is not included in said 
first list of addresses, said first list of objectionable 
character strings, said second list of addresses or said second 
list of acceptable character strings. 

28. The system in accordance with claim 27, further 
including a second folder for retaining electronic mail received 
at the central locatior whose address is not included in said 
first list of addresses, said first list of objectionable 
character strings, said second list of addresses or said second 
list of acceptable character strings, for review by the user at 
a later time, and further including an update device for adding 
addresses to said first filter or said second filter based upon 
the user's review, thereby creating a first updated list of 
addresses and a first updated list of second addresses. 



29. The system in accordance with claim 28, further 
including a device for periodically sending said first updated 
list of addresses to a master database situated at a location 

25 remote from the user's computer. 

30. The system in accordance with claim 29, further 
including a device for periodically receiving a list of addresses 
from the master database to be added to said first filter to 
create a second updated list of first addresses. 

31. A method for filtering electronic mail received 
over a communications medium be reviewed by a user's computer 
comprising : 

35 creating a first list of addresses to a central 

location connected to the communications medium thereby creating 
a first filter; 
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creating a second list of addresses from which the, 
user does wish to receive electronic mail; 

introducing said second list of addresses to the 
central location connected to the communication medium, thereby 
creating a second filter; 

comparing the addresses included in said first and 
second list of addresses with the address of an electronic mail 
received in the central location; 

automatically deleting the electronic mail from 
the central location if the address of the electronic mall is 
included in said first list of addresses; and 

retaining the electronic mail in a first location 
in the central location if the address of the electronic mall is 
in said second list of addresses or retaining the electronic mail 
in a second location in the central location if the address of 
the electronic mail is not included in either said first list of 
addresses or said second list of addresses. 

32. The method in accordance with claim 13, further 
comprising the steps of: 

reviewing any electronic mail included in said 
second location; 

eliminating any unwanted electronic mail fc>ased 
upon said reviewing step; 

recording the electronic address of any unwanted 
electronic mail in said first filter to create a first updated 
list of first addresses; and 

periodically transmitting said first updated list 
of first addresses to a master database situated at a remote 
location to create an updated master database. 

33. The method in accordance with claim 32, further 
including the step of periodically transmitting said updated 
master database to the central location to produce a second 
updated list of first addresses to be included into said first 
filter. 
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34. The method in accordance with claim 32, wherein, 
said first filter includes a list of objectionable character 
strings compared to character strings included in a received 
electronic mail. 



35. A method for mail server side filtering electronic 
mail received over a communication medium comprising the steps 
of : 

providing a first filter at the mail server 
including a list of spam messages which should not be sent to an 
end user; 

receiving a first electronic message ultimately 
intended for one or more end users at the mail server; 

comparing at the said mail server said received 
first electronic message to said spam messages provided in said 
first filter, said received first electronic message discarded 
if said received first electronic message is included as a spam 
message in said first filter and transmitting said first 
electronic message to the end user if said received first 
electronic message is not included as a spam message in said 
first filter; 

providing a second filter at the mail server for 
the receipt of a second electronic message sent over the 
communications medium from an end user, said second electronic 
message received at said second filter considered to be a spam 
message by the end user; and 

adding said second electronic message to said 
first filter as a spam message, if said second filter determines 
that said second electronic message is a spam message. 

36. The method in accordance with claim 35, further 
including the steps of: 

providing a counter associated with said second 

filter; 

counting the number of repeated second electronic 
messages received by said second filter; and 
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adding said second electronic message to said, 
first filter as a spam message if said counter exceeds a 
predetermined value. 

37. A system for filtering electronic ma.il received 
over a communication medium to an end user's computer, compris- 
ing: 

a mail server for receiving first electronic 
messages intended to be received by the end user, said mail 
server provided at a location remote from the end user; 

a first filter located at said mail server, said 
filter including a list of spam messages which should not be sent 
to the end user, said first filter also including a device for 
comparing at least a portion of the body of said first electronic 
15 messages with each of said spam messages; 

a second filter located at said mail server for 
receiving second electronic messages transmitted over the 
communication medium to said mail server from the end users; 

a counter and comparison device located at said 
mail server and in communication with said second filter for 
counting and classifying the number of said second electronic 
messages received by said second filter, said counting and 
comparison device determining the number of similar second 
electronic messages received by said second filter; 
25 wherein when the number of similar second 

electronic messages received by said second filter exceeds a 
predetermined number, said similar second electronic message is 
added to said first filter as an additional spam message. 



20 



WO 99/32985 



1/11 



PCT/US98/26233 




SUBSTITUTE SHEET (RULE 26) 



WO 99/32985 



PCT/US98/26233 



2/11 



s> 



e 

o 
o 

'6 
E 



E 
o 

E ~ 5 
•2, 2 @ 

2 HJ 

X ® O 

@ .2 «° 

is ? 



Sot 

I . I 



521 




to 



O 

a> 

x 

3 



a> 

.O O £ 

E "S 

O <D o 

o o; ii 



o 



00 I*** CM CN 



2 

Q_ 
CN 
CN 

cb 



S 2 25 

CL CL a. 

«- CM 

^ t~ o 

to di 6i 



CL 

6> 



-2 

Q) 

E 



a | g 

S -a @ 

O O jQ 



c 
o 



o a: 



O 
CO 

2 



5 

a> 
o 
o 



3 



< 



> 



3 

o 



g OT L Q= "S 

# £ ° ^ o 

.b= 3 O O 

<c S op ro z 



c 
o 

CD 



E 
o 
o 



g.T 



1= /a. 



3 



E 

o 

0 o 
5. «• » 

1 r u 

* II 

Q> CO 

0> 2> @ 

0 ^ co 
J2 * 

1 c 

CO 

^1 



81 



mi 



i 

Q 
Q. 
3 



O 
< 



O 
O 

or 



0O CD CN 0O CO CO 



a> 



< < 1 ^ ^ s 

o m i— $ ^ 

lO ro K> CD i-I 

- • » • »♦ 

^ M n ^ in 



o 

CL 



3 
o 



to 

CD 



@ CD 

O @ 

1 1 

c c: 



3. E , 

@ 2 -£ 

E «> o 

8- I 1 

to :s je 



CO 



c 
o 
o 

o 

E 



u 
c 
o 



E^ 

3 



3 



s 

CO 
ID 



CL 



(O 

m 



CN 
O 



SUBSTITUTE SHEET (RULE 26) 



WO 99/32985 



POVUS98/26233 



3/11 




HOME USER'S PC 




ZJM EMAIL ^ 






FILTER ADD- 






l ON 






, it 




l 






EMAIL CLIENT 






< j 





CENTRALIZED , 
EMAIL SYSTEM 
PROXY/MAIL 
SERVER 



» . _ 



CENTRAUZEO 
EMAIL SYSTEM 
DATABASE 
SERVER 



CORPORATE 
GATEWAY/PROXY^ 
SERVER 



CORPORATE 
EMAIL SERVER 



CENTRALIZED 
EMAIL SYSTEM LAN 



CORP ORATE USER'S PC 

J ZJM EMAIL * 

=v FILTER ADD- 

LlON 



EMAIL CLIENT 



, ^PORATE CUENT LAN_ 



19 
\ 



WEB BROWSER 
OR 

EMAIL ACCESS 
. APPLICATION t 



CENTRAUZED EMAIL 
SYSTEM 
USER'S PC 



LEGENO 
LAN 



! COMPUTER 



_____ j 



SOFTWARE PROGRAM, 
LIBRARY.OR SERVICE 



FIG.3 



SUBSTITUTE SHEET (RULE 26) 



WO 99/32985 



PCT/US98/26233 



4/11 



a 



CO 



4> 



CO 



1 




M — 

o 
£ 

LJ 



(0 



s n y 



o 



C 
3 
O 

o 
o 
< 

> 

< 

o 



o 
hi 



(0 

(0 Q) 

^- o 

< Z> 



E q 



O 

c a> 
o o 

u O 
Q_ 



CD 

2 o 
00 CL 



o 
o 



D 
Q_ 



(0 
3 



E 
o 



to 
o 



<D 

E 
o 



CO 
CO 

0) <— » 



a> 
o 

Q 



u_ C — \ 

-D >x 3 

■q ^ o © 

< O O (/) 



to 

1 




n> 



to 
a> 

Q£ 




iZ 



lilt 



J 



<D 



CD 



SUBSTITUTE SHEET (RULE 26) 



WO 99/32985 



PCT/US98/26233 



5/11 



CO 



2 



CM 

to 



E 
a> 



M 

© 



c 

o 
o 
o 

< 

a> 
# > 

a 
< 

© 



s 

m 

CN 

oo 
o 

CM 

o 

a> 
O 
o. 



to 
o 

CM 



CO 

Z> 



-O 

E 



o 

CO 



CL 



O 

E 
Ul 

B 



a> 
> 
o 



c 

o o 

0) o 



J) 



CO 
0) 



O 
C 

*> 

o 







Iville 




<* 








Z> 






o 






E 




E 






Na 


ess 












c 



-O 

.o 

"v_ 
O 
CO 

CO 
CO 
CD 

XI 

5 



O 

E 
ui 



Q) 
O 



0) ^ 

-S O 
CO CL 



O .b T3 S O *" 

— I u_ < o o 



CO 

1 




LO 



SUBSTITUTE SHEET (RULE 26) 



WO 99/32985 



PCT/US98/26233 



6/11 



SI 



1 



CO 



£ 
o 

0) 
< 



(O 

to 
(D 

< 
> 

3 



0) 
CO 

-a 

< 

0) 

# > 

o 
o 
c 



c c 



c 

D 

E 

(D 
CL 



C 

o 
E 

a> 
Q_ 



o 
i_ 

13 
O 
CO 



o o 



o 
o 
o 
o 
o 
o 
o 
o 
o 
o 



c 

3 

O 

o 



c 
a> 

CO 

a> 




c 

O 
O 

o 



CO 



c 

.2 
*co 

CO 

£ 

-Q 
3 

to 



(0 
D 



iSi 

iE 



1 



in 
0) 



a. x _j 




CD 
CD 



OO 
CO 



SUBSTITUTE SHEET (RULE 26) 



WO 99/32985 



PCT/US98/26233 



7/11 



1 



CO 
CO 



o 



HI 



O 

ft 

X 
ID 
V) 

<§> 

c 



0> 

E 
o 



CO 
CO 

"D 

< 



(0 
0) 

-o 

> 

O 
< 



c 

<D 
C 

o 
E 

i_ 

CL 



CO 
CO 
<D 
t_ 

TJ 
"D 
< 

CD 

o 

D 
C 



c 

CD 

c 

D 

E 

it- 
CD 
CL 



o o 



a> 






to 










o> 


o 






o 






o 




o 


o 




o 


o 




o 


o 




o 


o 




o 












C 


C 


D 


3 



aj 
O 



a 

13 
O 
C/) 



o 



c 
a> 

CO 

a> 



o 
o 

o 

o 

CO 



a> 



cn 

> 
o 

I 



c 



o* 




<l i i> 



CO 

CL X 3 



CO 
CD 



8 

O 
<S 

CO 
CO 

£ 

5 



(•h O 





csj 



o 



S CD 



CP S 



— O O 



SUBSTITUTE SHEET (RULE 26) 



WO 99/32985 



8/11 



INCOMING 
EMAIL 



FILTER ADMINISTRATION 
TOOL 



COLLABORATIVE FILTER 



SUBMITTAL FILTER 
© 



SUBMITTAL 
ADDITIONS 




\ POTENTIAL 
" FILTER 
ENTRIES 



SPAM RLTER AT 
MAIL SERVER 



SPAM FILTER AT 
MAIL DROP SERVICE 



SUBMITTED 
REJECTIONS 



DELIVERED 
EMAIL 




(^SUBMITTED 
^ SPAM 




FIG.8 



SUBSTITUTE SHEET (RULE 26) 



WO 99/32985 



PCT/US98/26233 



9/11 



MSG FOR 
USER A 



MSG FOR 
USER B 




SPAM MSG TO 
A AND B (1) 



COLLABORATIVE FILTER 



SPAM MSG (4) 




FIG.9 



SUBSTITUTE SHEET (RULE 26) 



WO 99/32985 



PCT/US98/26233 



10/11 



INCOMING 
E-MAIL 



3± 



MAIL SERVER 

I.E. SENDMAIL 
EXCHANGE 
ETC. 



IPC OR RPC CALLS 



FILTER E-„., 
(YES OR NO 



r 
IL? 



E-MAIL 
-DELIVERED 



MAIL STORAGE 

l£ USER 
MAILBOXES, 
FILE STORAGE, 
DATABASE, ETC. 



IPC OR RPC 
CALLS 



ZJM SOFTWARE 
ON INTERNET SERVICE PROVII 



SITE 



COLLABORATIVE 
FILTER 



FILE.NETWORK 
OR SQL 
COMMAND 



MAIL DROP 
SERVICE 

LE. P0P3 OR 
IMAP4 SERVER 





-ROWS 



FILTER E-MAIL? 
(YES OR NO) 



FILTER 
ADMINISTRATION 
TOOL 



END USER COMPUTER 



E-MAIL RETRIEVED 



INTERNET SOFTWARE 



MAIL DROP SERVICE COMMAND 



INTERNET SERVICE PROVIDER 



MAIL READER 



WEB BROWSER 

LE. INTERNET 
EXPLORER 



FIG. 10 



SUBSTITUTE SHEET (RULE 26) 



WO 99/32985 



PCT/US98/26233 



11/11 




FIG. 11 



SUBSTITUTE SHEET (RULE 26) 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US98/26233 



A. CLASSIFICATION OF SUBJECT MATTER 
IPC(6) :G06F 13/38, 15/17 
US CL :395/206 

According to International patent Clarification (IPC) or to both national classification and IPC 



B» FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 
U.S. : 395/206,207,217,300 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practicable, search terms used) 
APS 

search terms: electronic mail, e-mail, electronic messaging, filters, sort, junk e-mail 



C DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 4 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



A 

X, P 



US 5,619,648 A (CANALE et al.) 08 April 1997, Abstract, Fig. 1- 
4, Col. 1, lines 10-67, Col. 2, lines 6-48, CoL 3, lines 12-67, Col, 
4, lines 1-33, Col. 6, lines 66-67. 

US 5,377,354 A (SCANNELL et aL) 27 December 1994, Abstract, 
Fig. 1-2 

US 5,283,856 A (GROSS et al.) 01 February 1994, Abstract 

US 5,764,899 A (EGGLESTON et aL) 09 June 1998, CoL 2, lines 
66-67, CoL 3, lines 1-15, CoL 8, lines 15-59, CoL 9, lines 62-67, 
CoL 10, lines 1-11 



1-37 

1 
3 

1-31 



[x| Further documents are listed in the continuation of Box C. f~] See patent family annex. 



Special categories Of cited 

document defining the genera] state of the an which ii not 
to be of pertkukr rekvanoe 



document published on or after the mtenmtional filing date 

document which mty throw doubte on priority eUim(t) or which b 
cited to establish the publication dale or another citation or other 
special reason (at •pacified) 

document referring to an oral disclosure, um, exhibition or other 



document pubtuhed prior to the international filing date but later than 
the priority date claimed 



T later document published after the international filing date or priority 

date and not in conflict with the application but cited to understand 
the principle or theory underlying tho invention 

*X- document of particular relevance; the claimod invention cannot be 

considered novel or cannot be considered to involve an inventive step 
when the document is taken alone 

■Y" document of particular relevance; the claimod invention cannot be 

considered to involve an inventive step when the document is 
combined with one or more other such documents, such combination 
being obvious to a person skilled in tho art 

"A' document member of the same patent family 



Date of the actual completion of the international search 
10 FEBRUARY 1999 



Date of mailing of the international search report 

1 4 MAY 1999 



Name and mailing address of the ISA/US 
Commissioner of Patents and Trademarks 
Box PCT 

Washington, D.C. 20231 
Facsimile No. (703) 305*3230 



Authorized officer 
Frank J. Asta 
Telephone No. (703) 305-3817 



3e*i Hill 



Form PCT/ISA/210 (second sheet)(July 1992)* 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US98/26233 



C (Continuation). DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 


Citation of document, wich indication, when appropriate, of the relevant passages 




A,E 


US 5,826,022 A (NIELSEN) 20 October 1998, Abstract, Col. 3, 
lines 22- 41, Col. 5, lines 40-67, Col. 6, lines 1-32, Col. 7, lines 1- 
22 

US 5,864,684 A (NIELSEN) 26 January 1999, Col. 3, lines 11-37 


1-31 
1 



Form PCT/ISA/210 (continuation of second sheet)(July 1992)* 



